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(2) INFORMATION FOR SEQ ID NO : 1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7257 base pairs 

(B) TYPE: nucleic acid 





(C) STRANDEDNESS: double 

(D) Topology: linear 

(ii) MOLECULE TYPE: nucleic acid 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1072.. 4284 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 

TCAATATTGG CCATTAGCCA TATTATTCAT TGGTTATATA GCATAAATCA ATATTGGCTA 60 

TTGGCCATTG CATACGTTGT ATCTATATCA TAATATGTAC ATTTATATTG GCTCATGTCC 12 0 

AATATGACCG CCATGTTGGC ATTGATTATT GACTAGTTAT TAATAGTAAT CAATTACGGG 180 

GTCATTAGTT C AT AGCC CAT ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC 24 0 

GCCTGGCTGA CCGCCCAACG ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT 3 00 

AGTAACGCCA ATAGGGACTT TCCATTGACG TCAATGGGTG GAGTATTTAC GGTAAACTGC 3 60 

CCACTTGGCA GTACATCAAG TGTATCATAT GCCAAGTCCG CCCCCTATTG ACGTCAATGA 42 0 

CGGTAAATGG CCCGCCTGGC ATTATGCCCA GTACATGACC TTACGGGACT TTCCTACTTG 480 

GCAGTACATC TACGTATTAG TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAC 54 0 

CAATGGGCGT GGATAGCGGT TTGACTCACG GGGATTTCCA AGTCTCCACC CCATTGACGT 600 

CAATGGGAGT TTGTTTTGGC ACCAAAATCA ACGGGACTTT CCAAAATGTC GTAATAACCC 660 

CGCCCCGTTG ACGCAAATGG GCGGTAGGCG TGTACGGTGG GAGGTCTATA TAAGCAGAGC 720 

TCGTTTAGTG AACCGTCAGA TCACTAGAAG CTTTATTGCG GTAGTTTATC ACAGTTAAAT 7 80 

TGCTAACGCA GTCAGTGCTT CTGACACAAC AGTCTCGAAC TTAAGCTGCA GAAGTTGGTC 840 

GTGAGGCACT GGGCAGGTAA GTATCAAGGT TACAAGACAG GTTTAAGGAG ACCAATAGA& 900 

ACTGGGCTTG TCGAGACAGA GAAGACTCTT GCGTTTCTGA TAGGCACCTA TTGGTCTTAC 960 

TGACATCCAC TTTGCCTTTC TCTCCACAGG TGTCCACTCC CAGTTCAATT ACAGCTCTTA 102 0 

AGGCTAGAGT ACTTAATACG ACTCACTATA GGCTAGCGAA GGAGATCCGC C ATG GCC 1077 



Met Ala 
1 



CAC CAT CAC CAC CAT CAC GGA TAT CCA TAC GAC 
His His His His His His Gly Tyr Pro Tyr Asp 
5 10 



GTG CCA GAT TAC GCT 
Val Pro Asp Tyr Ala 
15 



1125 



CAG TCG AGT GCC ATG AGT AAA GGA GAA GAA CTT 
Gin Ser Ser Ala Met Ser Lys Gly Glu Glu Leu 
20 25 



TTC ACT GGA GTT GTC 
Phe Thr Gly Val Val 
30 



1173 



CCA ATT CTT GTT GAA TTA GAT GGT GAT GTT AAT GGG CAC AAA TTT TCT 12 21 

Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser 
35 40 45 50 

GTC AGT GGA GAG GGT GAA GGT GAT GCA ACA TAC GGA AAA CTT ACC CTT 12 69 

Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu 
55 60 65 

AAA TTT ATT TGC ACT ACT GGA AAA CTA CCT GTT CCT TGG CCA ACA CTT 1317 
Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu 
70 75 80 

GTC ACT ACT TTC ACT TAT GGT GTT CAA TGC TTT TCA AGA TAC CCA GAT 1365 
Val Thr Thr Phe Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp 
85 90 95 

CAT ATG AAA CAG CAT GAC TTT TTC AAG AGT GCC ATG CCC GAA GGT TAT 1413 
His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr 
100 105 110 

GTA CAG GAA AGA ACT ATA TTT TTC AAA GAT GAC GGG AAC TAC AAG ACA 1461 
Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr 
115 120 125 130 

CGT GCT GAA GTC AAG TTT GAA GGT GAT ACC CTT GTT AAT AGA ATC GAG 1509 
Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu 
135 140 145 

TTA AAA GGT ATT GAT TTT AAA GAA GAT GGA AAC ATT CTT GGA CAC AAA 1557 
Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly His Lys 
150 155 160 

TTG GAA TAC AAC TAT AAC TCA CAC AAT GTA TAC ATC ATG GCA GAC AAA 1605 
Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys 
165 170 175 

CAA AAG AAT GGA ATC AAA GTT AAC TTC AAA ATT AGA CAC AAC ATT GAA 1653 
Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn lie Glu 
180 185 190 

GAT GGA AGC GTT CAA CTA GCA GAC CAT TAT CAA CAA AAT ACT CCA ATT 1701 
Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie 
195 200 205 210 

GGC GAT GGC CCT GTC CTT TTA CCA GAC AAC CAT TAC CTG TCC ACA CAA 174 9 

Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin 
215 220 225 

TCT GCC CTT TCG AAA GAT CCC AAC GAA AAG AGA GAC CAC ATG GTC CTT 17 97 

Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu 
230 235 240 

CTT GAG TTT GTA ACA GCT GCT GGG ATT ACA CAT GGC ATG GAT GAA CTA 1845 
Leu Glu Phe Val Thr Ala Ala Gly lie Thr His Gly Met Asp Glu Leu 
245 250 255 



TAC AAA GGC GCC GGC GCT GGT GCT GGT GCT GGC GCC ATC AGC GCG CTG 18 93 

Tyr Lys Gly Ala Gly Ala Gly Ala Gly Ala Gly Ala lie Ser Ala Leu 
260 265 270 

ATC CTG GAC TCC AAA GAA TCC TTA GCT CCC CCT GGT AGA GAC GAA GTC 1941 

He Leu Asp Ser Lys Glu Ser Leu Ala Pro Pro Gly Arg Asp Glu Val 

275 280 285 290 

CCT GGC AGT TTG CTT GGC CAG GGG AGG GGG AGC GTA ATG GAC TTT TAT 1989 
Pro Gly Ser Leu Leu Gly Gin Gly Arg Gly Ser Val Met Asp Phe Tyr 
295 300 305 

AAA AGC CTG AGG GGA GGA GCT ACA GTC AAG GTT TCT GCA TCT TCG CCC 2037 

Lys Ser Leu Arg Gly Gly Ala Thr Val Lys Val Ser Ala Ser Ser Pro 
310 315 320 

TCA GTG GCT GCT GCT TCT CAG GCA GAT TCC AAG CAG CAG AGG ATT CTC 20 85 

Ser Val Ala Ala Ala Ser Gin Ala Asp Ser Lys Gin Gin Arg He Leu 
325 330 335 

CTT GAT TTC TCG AAA GGC TCC ACA AGC AAT GTG CAG CAG CGA CAG CAG 2133 

Leu Asp Phe Ser Lys Gly Ser Thr Ser Asn Val Gin Gin Arg Gin Gin 
340 345 350 

CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG CAG 2181 

Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin 

355 360 365 370 

CAG CCA GGC TTA TCC AAA GCC GTT TCA CTG TCC ATG GGG CTG TAT ATG 22 29 

Gin Pro Gly Leu Ser Lys Ala Val Ser Leu Ser Met Gly Leu Tyr Met 
375 380 385 

GGA GAG ACA GAA ACA AAA GTG ATG GGG AAT GAC TTG GGC TAC CCA CAG 22 77 

Gly Glu Thr Glu Thr Lys Val Met Gly Asn Asp Leu Gly Tyr Pro Gin 
390 395 400 

CAG GGC CAA CTT GGC CTT TCC TCT GGG GAA ACA GAC TTT CGG CTT CTG 2325 

Gin Gly Gin Leu Gly Leu Ser Ser Gly Glu Thr Asp Phe Arg Leu Leu 
405 410 415 

GAA GAA AGC ATT GCA AAC CTC AAT AGG TCG ACC AGC GTT CCA GAG AAC 23 73 

Glu Glu Ser He Ala Asn Leu Asn Arg Ser Thr Ser Val Pro Glu Asn 
420 425 430 

CCC AAG AGT TCA ACG TCT GCA ACT GGG TGT GCT ACC CCG ACA GAG AAG 24 21 

Pro Lys Ser Ser Thr Ser Ala Thr Gly Cys Ala Thr Pro Thr Glu Lys 

435 440 445 450 

GAG TTT CCC AAA ACT CAC TCG GAT GCA TCT TCA GAA CAG CAA AAT CGA 24 69 

Glu Phe Pro Lys Thr His Ser Asp Ala Ser Ser Glu Gin Gin Asn Arg 
455 460 465 

AAA AGC CAG ACC GGC ACC AAC GGA GGC AGT GTG AAA TTG TAT CCC ACA 2517 

Lys Ser Gin Thr Gly Thr Asn Gly Gly Ser Val Lys Leu Tyr Pro Thr 



470 475 480 



GAC CAA AGC ACC TTT GAC CTC TTG AAG GAT TTG GAG TTT TCC GCT GGG 25 65 

Asp Gin Ser Thr Phe Asp Leu Leu Lys Asp Leu Glu Phe Ser Ala Gly 
485 490 495 

TCC CCA AGT AAA GAC AC A AAC GAG AGT CCC TGG AGA TCA GAT CTG TTG 2613 
Ser Pro Ser Lys Asp Thr Asn Glu Ser Pro Trp Arg Ser Asp Leu Leu 
500 505 510 

ATA GAT GAA AAC TTG CTT TCT CCT TTG GCG GGA GAA GAT GAT CCA TTC 2 661 

lie Asp Glu Asn Leu Leu Ser Pro Leu Ala Gly Glu Asp Asp Pro Phe 
515 520 525 530 

CTT CTC GAA GGG AAC ACG AAT GAG GAT TGT AAG CCT CTT ATT TTA CCG 27 09 

Leu Leu Glu Gly Asn Thr Asn Glu Asp Cys Lys Pro Leu lie Leu Pro 
535 540 545 

GAC ACT AAA CCT AAA ATT AAG GAT ACT GGA GAT ACA ATC TTA TCA AGT 2757 
Asp Thr Lys Pro Lys lie Lys Asp Thr Gly Asp Thr lie Leu Ser Ser 
550 555 560 

CCC AGC AGT GTG GCA CTA CCC CAA GTG AAA ACA GAA AAA GAT GAT TTC 2 805 

Pro Ser Ser Val Ala Leu Pro Gin Val Lys Thr Glu Lys Asp Asp Phe 
565 570 575 

ATT GAA CTT TGC ACC CCC GGG GTA ATT AAG CAA GAG AAA CTG GGC CCA 2 853 

lie Glu Leu Cys Thr Pro Gly Val lie Lys Gin Glu Lys Leu Gly Pro 
580 585 590 

GTT TAT TGT CAG GCA AGC TTT TCT GGG ACA AAT ATA ATT GGT AAT AAA 2 901 

Val Tyr Cys Gin Ala Ser Phe Ser Gly Thr Asn lie lie Gly Asn Lys 
595 600 605 610 

ATG TCT GCC ATT TCT GTT CAT GGT GTG AGT ACC TCT GGA GGA CAG ATG 2 94 9 

Met Ser Ala He Ser Val His Gly Val Ser Thr Ser Gly Gly Gin Met 
615 620 625 

TAC CAC TAT GAC ATG AAT ACA GCA TCC CTT TCT CAG CAG CAG GAT CAG 2 997 

Tyr His Tyr Asp Met Asn Thr Ala Ser Leu Ser Gin Gin Gin Asp Gin 
630 635 640 

AAG CCT GTT TTT AAT GTC ATT CCA CCA ATT CCT GTT GGT TCT GAA AAC 3 045 

Lys Pro Val Phe Asn Val He Pro Pro He Pro Val Gly Ser Glu Asn 
645 650 655 

TGG AAT AGG TGC CAA GGC TCC GGA GAG GAC AGC CTG ACT TCC TTG GGG 3 093 

Trp Asn Arg Cys Gin Gly Ser Gly Glu Asp Ser Leu Thr Ser Leu Gly 
660 665 670 

GCT CTG AAC TTC CCA GGC CGG TCA GTG TTT TCT AAT GGG TAC TCA AGC 3141 
Ala Leu Asn Phe Pro Gly Arg Ser Val Phe Ser Asn Gly Tyr Ser Ser 
675 680 685 690 

CCT GGA ATG AGA CCA GAT GTA AGC TCT CCT CCA TCC AGC TCG TCA GCA 318 9 



Pro Gly Met Arg Pro Asp Val Ser Ser Pro Pro Ser Ser Ser Ser Ala 
695 700 705 



GCC ACG GGA CCA CCT CCC AAG CTC TGC CTG GTG TGC TCC GAT GAA GCT 3237 

Ala Thr Gly Pro Pro Pro Lys Leu Cys Leu Val Cys Ser Asp Glu Ala 
710 715 720 

TCA GGA TGT CAT TAC GGG GTG CTG ACA TGT GGA AGC TGC AAA GTA TTC 32 85 

Ser Gly Cys His Tyr Gly Val Leu Thr Cys Gly Ser Cys Lys Val Phe 
725 730 735 

TTT AAA AGA GCA GTG GAA GGA CAG CAC AAT TAC CTT TGT GCT GGA AGA 3333 

Phe Lys Arg Ala Val Glu Gly Gin His Asn Tyr Leu Cys Ala Gly Arg 

740 745 750 

AAC GAT TGC ATC ATT GAT AAA ATT CGA AGG AAA AAC TGC CCA GCA TGC 33 81 

Asn Asp Cys lie lie Asp Lys lie Arg Arg Lys Asn Cys Pro Ala Cys 
755 760 765 770 

CGC TAT CGG AAA TGT CTT CAG GCT GGA ATG AAC CTT GAA GCT CGA AAA 342 9 

Arg Tyr Arg Lys Cys Leu Gin Ala Gly Met Asn Leu Glu Ala Arg Lys 
775 780 785 

ACA AAG AAA AAA ATC AAA GGG ATT CAG CAA GCC ACT GCA GGA GTC TCA 34 77 

Thr Lys Lys Lys He Lys Gly He Gin Gin Ala Thr Ala Gly Val Ser 
790 795 800 

CAA GAC ACT TCG GAA AAT CCT AAC AAA ACA ATA GTT CCT GCA GCA TTA 3 52 5 

Gin Asp Thr Ser Glu Asn Pro Asn Lys Thr He Val Pro Ala Ala Leu 
805 810 815 

CCA CAG CTC ACC CCT ACC TTG GTG TCA CTG CTG GAG GTG ATT GAA CCC 3573 

Pro Gin Leu Thr Pro Thr Leu Val Ser Leu Leu Glu Val He Glu Pro 

820 825 830 

GAG GTG TTG TAT GCA GGA TAT GAT AGC TCT GTT CCA GAT TCA GCA TGG 3 621 

Glu Val Leu Tyr Ala Gly Tyr Asp Ser Ser Val Pro Asp Ser Ala Trp 
835 840 845 850 

AGA ATT ATG ACC ACA CTC AAC ATG TTA GGT GGG CGT CAA GTG ATT GCA 3 669 

Arg He Met Thr Thr Leu Asn Met Leu Gly Gly Arg Gin Val He Ala 
855 860 865 

GCA GTG AAA TGG GCA AAG GCG ATA CTA GGC TTG AGA AAC TTA CAC CTC 3 717 

Ala Val Lys Trp Ala Lys Ala He Leu Gly Leu Arg Asn Leu His Leu 
870 875 880 

GAT GAC CAA ATG ACC CTG CTA CAG TAC TCA TGG ATG TTT CTC ATG GCA 3 7 65 

Asp Asp Gin Met Thr Leu Leu Gin Tyr Ser Trp Met Phe Leu Met Ala 
885 890 895 

TTT GCC TTG GGT TGG AGA TCA TAC AGA CAA TCA AGC GGA AAC CTG CTC 3 813 

Phe Ala Leu Gly Trp Arg Ser Tyr Arg Gin Ser Ser Gly Asn Leu Leu 

900 905 910 




TGC TTT GCT CCT GAT CTG ATT ATT AAT GAG CAG AGA ATG TCT CTA CCC 3 861 

Cys Phe Ala Pro Asp Leu lie lie Asn Glu Gin Arg Met Ser Leu Pro 
915 920 925 930 



GGC ATG TAT GAG CAA TGT AAA CAC ATG CTG TTT GTC TCC TCT GAA TTA 3 90 9 

Gly Met Tyr Asp Gin Cys Lys His Met Leu Phe Val Ser Ser Glu Leu 
935 940 945 

CAA AGA TTG CAG GTA TCC TAT GAA GAG TAT CTC TGT ATG AAA ACC TTA 3 957 

Gin Arg Leu Gin Val Ser Tyr Glu Glu Tyr Leu Cys Met Lys Thr Leu 
950 955 960 

CTG CTT CTC TCC TCA GTT CCT AAG GAA GGT CTG AAG AGC CAA GAG TTA 4 005 

Leu Leu Leu Ser Ser Val Pro Lys Glu Gly Leu Lys Ser Gin Glu Leu 
965 970 975 

TTT GAT GAG ATT CGA ATG ACT TAT ATC AAA GAG CTA GGA AAA GCC ATC 4 053 

Phe Asp Glu lie Arg Met Thr Tyr lie Lys Glu Leu Gly Lys Ala He 
980 985 990 



GTC AAA AGG GAA GGG AAC TCC AGT CAG AAC TGG CAA CGG TTT TAC CAA 4101 
Val Lys Arg Glu Gly Asn Ser Ser Gin Asn Trp Gin Arg Phe Tyr Gin 
995 1000 1005 1010 

CTG ACA AAG CTT CTG GAC TCC ATG CAT GAG GTG GTT GAG AAT CTC CTT 414 9 

Leu Thr Lys Leu Leu Asp Ser Met His Glu Val Val Glu Asn Leu Leu 
1015 1020 1025 

ACC TAC TGC TTC CAG ACA TTT TTG GAT AAG ACC ATG AGT ATT GAA TTC 4197 
Thr Tyr Cys Phe Gin Thr Phe Leu Asp Lys Thr Met Ser He Glu Phe 
1030 1035 1040 

CCA GAG ATG TTA GCT GAA ATC ATC ACT AAT CAG ATA CCA AAA TAT TCA 424 5 

Pro Glu Met Leu Ala Glu He He Thr Asn Gin He Pro Lys Tyr Ser 
1045 1050 1055 

AAT GGA AAT ATC AAA AAG CTT CTG TTT CAT CAA AAA TGA CTGCCTTACT 42 94 

Asn Gly Asn He Lys Lys Leu Leu Phe His Gin Lys * 
1060 1065 1070 

AAGAAAGGTT GCCTTAAAGA AAGTTGAATT TATAGTCTAG AGTCGACCCG GGCGGCCGCT 4354 

TCGAGCAGAC ATGATAAGAT ACATTGATGA GTTTGGACAA ACCACAACTA GAATGCAGTG 4414 

AAAAAAATGC TTTATTTGTG AAATTTGTGA TGCTATTGCT TTATTTGTAA CCATTATAAG 4474 

CTGCAATAAA CAAGTTAACA ACAACAATTG CATTCATTTT ATGTTTCAGG TTCAGGGGGA 4534 

GATGTGGGAG GTTTTTTAAA GCAAGTAAAA CCTCTACAAA TGTGGTAAAA TCGATAAGGA 4594 

TCCGGGCTGG CGTAATAGCG AAGAGGCCCG CACCGATCGC CCTTCCCAAC AGTTGCGCAG 4 654 

CCTGAATGGC GAATGGACGC GCCCTGTAGC GGCGCATTAA GCGCGGCGGG TGTGGTGGTT 4714 




ACGCGCAGCG TGACCGCTAC ACTTGCCAGC GCCCTAGCGC CCGCTCCTTT CGCTTTCTTC 47 74 

CCTTCCTTTC TCGCCACGTT CGCCGGCTTT CCCCGTCAAG CTCTAAATCG GGGGCTCCCT 4 834 

TTAGGGTTCC GATTTAGAGC TTTACGGCAC CTCGACCGCA AAAAACTTGA TTTGGGTGAT 4 8 94 

GGTTCACGTA GTGGGCCATC GCCCTGATAG ACGGTTTTTC GCCCTTTGAC GTTGGAGTCC 4 954 

ACGTTCTTTA ATAGTGGACT CTTGTTCCAA ACTGGAACAA CACTCAACCC TATCTCGGTC 5014 

TATTCTTTTG ATTTATAAGG GATTTTGCCG ATTTCGGCCT ATTGGTTAAA AAATGAGCTG 5074 

ATTTAACAAA TATTTAACGC GAATTTTAAC AAAATATTAA CGTTTACAAT TTCGCCTGAT 5134 

GCGGTATTTT CTCCTTACGC ATCTGTGCGG TATTTCACAC CGCATATGGT GCACTCTCAG 5194 

TACAATCTGC TCTGATGCCG CATAGTTAAG CCAGCCCCGA CACCCGCCAA CACCCGCTGA 5254 

CGCGCCCTGA CGGGCTTGTC TGCTCCCGGC ATCCGCTTAC AGACAAGCTG TGACCGTCTC 5314 

CGGGAGCTGC ATGTGTCAGA GGTTTTCACC GTCATCACCG AAACGCGCGA GACGAAAGGG 53 74 

CCTCGTGATA CGCCTATTTT TATAGGTTAA TGTCATGATA ATAATGGTTT CTTAGACGTC 54 34 

AGGTGGCACT TTTCGGGGAA ATGTGCGCGG AACCCCTATT TGTTTATTTT TCTAAATACA 54 94 

TTCAAATATG TATCCGCTCA TGAGACAATA ACCCTGATAA ATGCTTCAAT AATATTGAAA 5554 

AAGGAAGAGT ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT TTGCGGCATT 5614 

TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG CTGAAGATCA 5674 

GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC AGCGGTAAGA TCCTTGAGAG 57 34 

TTTTCGCCCC GAAGAACGTT TTCCAATGAT GAGCACTTTT AAAGTTCTGC TATGTGGCGC 57 94 

GGTATTATCC CGTATTGACG CCGGGCAAGA GCAACTCGGT CGCCGCATAC ACTATTCTCA 5854 

GAATGACTTG GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG GCATGACAGT 5914 

AAGAGAATTA TGCAGTGCTG C C AT AAC CAT GAGTGATAAC ACTGCGGCCA ACTTACTTCT 5974 

GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG CACAACATGG GGGATCATGT 6034 

AACTCGCCTT GATCGTTGGG AACCGGAGCT GAATGAAGCC AT AC CAAACG ACGAGCGTGA 60 94 

CACCACGATG CCTGTAGCAA TGGCAACAAC GTTGCGCAAA CTATTAACTG GCGAACTACT 6154 

TACTCTAGCT TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG TTGCAGGACC 6214 

ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG GAGCCGGTGA 62 74 

GCGTGGGTCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT GGTAAGCCCT CCCGTATCGT 6334 

AGTTATCTAC ACGACGGGGA GTCAGGCAAC TATGGATGAA CGAAATAGAC AGATCGCTGA 63 94 

GATAGGTGCC TCACTGATTA AGCATTGGTA ACTGTCAGAC CAAGTTTACT CATATATACT 64 54 




TTAGATTGAT TTAAAACTTC ATTTTTAATT TAAAAGGATC TAGGTGAAGA TCCTTTTTGA 6514 

TAATCTCATG ACCAAAATCC CTTAACGTGA GTTTTCGTTC CACTGAGCGT CAGACCCCGT 65 74 

AGAAAAGATC AAAGGATCTT CTTGAGATCC TTTTTTTCTG CGCGTAATCT GCTGCTTGCA 6634 

AACAAAAAAA CCACCGCTAC CAGCGGTGGT TTGTTTGCCG GATCAAGAGC TACCAACTCT 6694 

TTTTCCGAAG GTAACTGGCT TCAGCAGAGC GCAGATACCA AATACTGTCC TTCTAGTGTA 6754 

GCCGTAGTTA GGCCACCACT TCAAGAACTC TGTAGCACCG CCTACATACC TCGCTCTGCT 6814 

AATCCTGTTA CCAGTGGCTG CTGCCAGTGG CGATAAGTCG TGTCTTACCG GGTTGGACTC 68 74 

AAGACGATAG TTAC CGGATA AGGCGCAGCG GTCGGGCTGA ACGGGGGGTT CGTGCACACA 6934 

GCCCAGCTTG GAGCGAACGA CCTACACCGA ACTGAGATAC CTACAGCGTG AGCTATGAGA 6994 

AAGCGCCACG CTTCCCGAAG GGAGAAAGGC GGACAGGTAT CCGGTAAGCG GCAGGGTCGG 7054 

AACAGGAGAG CGCACGAGGG AGCTTCCAGG GGGAAACGCC TGGTATCTTT ATAGTCCTGT 7114 

CGGGTTTCGC CACCTCTGAC TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG GGGGGCGGAG 7174 

CCTATGGAAA AACGCCAGCA ACGCGGCCTT TTTACGGTTC CTGGCCTTTT GCTGGCCTTT 72 34 

TGCTCACATG GCTCGACAGA TCT 72 57 

(2) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1071 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 2 : 

Met Ala His His His His His His Gly Tyr Pro Tyr Asp Val Pro Asp 
15 10 15 

Tyr Ala Gin Ser Ser Ala Met Ser Lys Gly Glu Glu Leu Phe Thr Gly 
20 25 30 

Val Val Pro lie Leu Val Glu Leu Asp Gly Asp Val Asn Gly His Lys 
35 40 45 

Phe Ser Val Ser Gly Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu 
50 55 60 

Thr Leu Lys Phe lie Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro 
65 70 75 80 

Thr Leu Val Thr Thr Phe Thr Tyr Gly Val Gin Cys Phe Ser Arg Tyr 



85 90 95 

Pro Asp His Met Lys Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu 
100 105 110 

Gly Tyr Val Gin Glu Arg Thr lie Phe Phe Lys Asp Asp Gly Asn Tyr 
115 120 125 

Lys Thr Arg Ala Glu Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg 
130 135 140 

lie Glu Leu Lys Gly lie Asp Phe Lys Glu Asp Gly Asn lie Leu Gly 
145 150 155 160 

His Lys Leu Glu Tyr Asn Tyr Asn Ser His Asn Val Tyr lie Met Ala 
165 170 175 

Asp Lys Gin Lys Asn Gly lie Lys Val Asn Phe Lys lie Arg His Asn 
180 185 190 

lie Glu Asp Gly Ser Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr 
195 200 205 

Pro lie Gly Asp Gly Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser 
210 215 220 

Thr Gin Ser Ala Leu Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met 
225 230 235 240 

Val Leu Leu Glu Phe Val Thr Ala Ala Gly He Thr His Gly Met Asp 
245 250 255 

Glu Leu Tyr Lys Gly Ala Gly Ala Gly Ala Gly Ala Gly Ala He Ser 
260 265 270 

Ala Leu He Leu Asp Ser Lys Glu Ser Leu Ala Pro Pro Gly Arg Asp 
275 280 285 

Glu Val Pro Gly Ser Leu Leu Gly Gin Gly Arg Gly Ser Val Met Asp 
290 295 300 

Phe Tyr Lys Ser Leu Arg Gly Gly Ala Thr Val Lys Val Ser Ala Ser 
305 310 315 320 

Ser Pro Ser Val Ala Ala Ala Ser Gin Ala Asp Ser Lys Gin Gin Arg 
325 330 335 

He Leu Leu Asp Phe Ser Lys Gly Ser Thr Ser Asn Val Gin Gin Arg 
340 345 350 

Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin Gin 
355 360 365 

Gin Gin Gin Pro Gly Leu Ser Lys Ala Val Ser Leu Ser Met Gly Leu 
370 375 380 

Tyr Met Gly Glu Thr Glu Thr Lys Val Met Gly Asn Asp Leu Gly Tyr 



385 

Pro Gin Gin Gly 

Leu Leu Glu Glu 
420 

Glu Asn Pro Lys 
435 




390 

Gin Leu Gly Leu 
405 

Ser lie Ala Asn 

Ser Ser Thr Ser 
440 



395 

Ser Ser Gly Glu 
410 

Leu Asn Arg Ser 
425 

Ala Thr Gly Cys 




400 

Thr Asp Phe Arg 
415 

Thr Ser Val Pro 
430 

Ala Thr Pro Thr 
445 



Glu Lys Glu Phe Pro Lys Thr His Ser Asp Ala Ser Ser Glu Gin Gin 
450 455 460 

Asn Arg Lys Ser Gin Thr Gly Thr Asn Gly Gly Ser Val Lys Leu Tyr 
465 470 475 480 

Pro Thr Asp Gin Ser Thr Phe Asp Leu Leu Lys Asp Leu Glu Phe Ser 
485 490 495 

Ala Gly Ser Pro Ser Lys Asp Thr Asn Glu Ser Pro Trp Arg Ser Asp 
500 505 510 

Leu Leu lie Asp Glu Asn Leu Leu Ser Pro Leu Ala Gly Glu Asp Asp 
515 520 525 

Pro Phe Leu Leu Glu Gly Asn Thr Asn Glu Asp Cys Lys Pro Leu lie 
530 535 540 

Leu Pro Asp Thr Lys Pro Lys lie Lys Asp Thr Gly Asp Thr lie Leu 
545 550 555 560 

Ser Ser Pro Ser Ser Val Ala Leu Pro Gin Val Lys Thr Glu Lys Asp 
565 570 575 

Asp Phe lie Glu Leu Cys Thr Pro Gly Val lie Lys Gin Glu Lys Leu 
580 585 590 

Gly Pro Val Tyr Cys Gin Ala Ser Phe Ser Gly Thr Asn lie He Gly 
595 600 605 

Asn Lys Met Ser Ala He Ser Val His Gly Val Ser Thr Ser Gly Gly 
610 615 620 

Gin Met Tyr His Tyr Asp Met Asn Thr Ala Ser Leu Ser Gin Gin Gin 
625 630 635 640 

Asp Gin Lys Pro Val Phe Asn Val He Pro Pro He Pro Val Gly Ser 
645 650 655 

Glu Asn Trp Asn Arg Cys Gin Gly Ser Gly Glu Asp Ser Leu Thr Ser 
660 665 670 



Leu Gly Ala Leu Asn Phe Pro Gly Arg Ser Val Phe Ser Asn Gly Tyr 
675 680 685 



Ser Ser Pro Gly Met Arg Pro Asp Val Ser Ser Pro Pro Ser Ser Ser 
690 695 700 

Ser Ala Ala Thr Gly Pro Pro Pro Lys Leu Cys Leu Val Cys Ser Asp 
705 710 715 720 

Glu Ala Ser Gly Cys His Tyr Gly Val Leu Thr Cys Gly Ser Cys Lys 
725 730 735 

Val Phe Phe Lys Arg Ala Val Glu Gly Gin His Asn Tyr Leu Cys Ala 
740 745 750 



Gly Arg Asn Asp Cys lie lie Asp Lys lie Arg Arg Lys Asn Cys Pro 
755 760 765 

Ala Cys Arg Tyr Arg Lys Cys Leu Gin Ala Gly Met Asn Leu Glu Ala 
770 775 780 

Arg Lys Thr Lys Lys Lys lie Lys Gly lie Gin Gin Ala Thr Ala Gly 
785 790 795 800 

Val Ser Gin Asp Thr Ser Glu Asn Pro Asn Lys Thr lie Val Pro Ala 
805 810 815 

Ala Leu Pro Gin Leu Thr Pro Thr Leu Val Ser Leu Leu Glu Val lie 
820 825 830 

Glu Pro Glu Val Leu Tyr Ala Gly Tyr Asp Ser Ser Val Pro Asp Ser 
835 840 845 

Ala Trp Arg lie Met Thr Thr Leu Asn Met Leu Gly Gly Arg Gin Val 
850 855 860 

lie Ala Ala Val Lys Trp Ala Lys Ala lie Leu Gly Leu Arg Asn Leu 
865 870 875 880 

His Leu Asp Asp Gin Met Thr Leu Leu Gin Tyr Ser Trp Met Phe Leu 
885 890 895 

Met Ala Phe Ala Leu Gly Trp Arg Ser Tyr Arg Gin Ser Ser Gly Asn 
900 905 910 

Leu Leu Cys Phe Ala Pro Asp Leu lie lie Asn Glu Gin Arg Met Ser 
915 920 925 

Leu Pro Gly Met Tyr Asp Gin Cys Lys His Met Leu Phe Val Ser Ser 
930 935 940 

Glu Leu Gin Arg Leu Gin Val Ser Tyr Glu Glu Tyr Leu Cys Met Lys 
945 950 955 960 

Thr Leu Leu Leu Leu Ser Ser Val Pro Lys Glu Gly Leu Lys Ser Gin 
965 970 975 

Glu Leu Phe Asp Glu lie Arg Met Thr Tyr lie Lys Glu Leu Gly Lys 
980 985 990 



Ala lie Val Lys Arg Glu Gly Asn Ser Ser Gin Asn Trp Gin Arg Phe 
995 1000 1005 



Tyr Gin Leu Thr Lys Leu Leu Asp Ser Met His Glu Val Val Glu Asn 
1010 1015 1020 

Leu Leu Thr Tyr Cys Phe Gin Thr Phe Leu Asp Lys Thr Met Ser lie 
1025 1030 1035 1040 

Glu Phe Pro Glu Met Leu Ala Glu He He Thr Asn Gin He Pro Lys 

1045 1050 1055 



Tyr Ser Asn Gly Asn He Lys Lys Leu Leu Phe His Gin Lys * 
1060 1065 1070 



(2) INFORMATION FOR SEQ ID NO : 3 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: nucleic acid 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3 : 
GCGCGCTGAT CAGAATTCCT TTTAGGAATT CTGATCAGCG CGCTGA 



(2) INFORMATION FOR SEQ ID NO : 4 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4 : 
AGAACANNNT GTTCT 

(2) INFORMATION FOR SEQ ID NO : 5 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 





(ii) MOLECULE TYPE: oligonucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5 : 



AGGTCANNNT GACCT 



15 



(2) INFORMATION FOR SEQ ID NO : 6 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 6 : 
TCGAGCOCGC AAGAACACAG TGTTCTGACG ACACGAAGAA CAGGATGTTC TCGTACAGTG 60 



(2) INFORMATION FOR SEQ ID NO : 7 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7 : 
TCGACACTGT ACGAGAACAT CCTGTTCTTC GTGTCGTCAG AACACTGTGT TCTTGCGCGC 60 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: oligonucleotide 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



TCGAGCGCGC AAGGTCACAG TGACCTGACG ACACGAAGGT CAGGATGACC TCGTACAGTG 



60 




(2) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9 : 



TCGACACTGT ACGAGGTCAT CCTGACCTTC GTGTCGTCAG GTCACTGTGA CCTTGCGCGC 



60 
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